Overview

Dataset statistics

Number of variables18
Number of observations8950
Missing cells314
Missing cells (%)0.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 MiB
Average record size in memory144.0 B

Variable types

NUM17
CAT1

Reproduction

Analysis started2020-05-21 07:13:45.448083
Analysis finished2020-05-21 07:14:40.269968
Duration54.82 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

oneoff_purchases is highly correlated with purchasesHigh correlation
purchases is highly correlated with oneoff_purchasesHigh correlation
minimum_payments has 313 (3.5%) missing values Missing
cust_id has unique values Unique
purchases has 2044 (22.8%) zeros Zeros
oneoff_purchases has 4302 (48.1%) zeros Zeros
installments_purchases has 3916 (43.8%) zeros Zeros
cash_advance has 4628 (51.7%) zeros Zeros
purchases_frequency has 2043 (22.8%) zeros Zeros
oneoff_purchases_frequency has 4302 (48.1%) zeros Zeros
purchases_installments_frequency has 3915 (43.7%) zeros Zeros
cash_advance_frequency has 4628 (51.7%) zeros Zeros
cash_advance_trx has 4628 (51.7%) zeros Zeros
purchases_trx has 2044 (22.8%) zeros Zeros
payments has 240 (2.7%) zeros Zeros
prc_full_payment has 5903 (66.0%) zeros Zeros

Variables

cust_id
Categorical

UNIQUE

Distinct count8950
Unique (%)100.0%
Missing0
Missing (%)0.0%
Memory size69.9 KiB
C14537
 
1
C11221
 
1
C14939
 
1
C11027
 
1
C16106
 
1
Other values (8945)
8945
ValueCountFrequency (%) 
C145371< 0.1%
 
C112211< 0.1%
 
C149391< 0.1%
 
C110271< 0.1%
 
C161061< 0.1%
 
C106211< 0.1%
 
C160821< 0.1%
 
C150991< 0.1%
 
C113621< 0.1%
 
C168491< 0.1%
 
Other values (8940)894099.9%
 

Length

Max length6
Median length6
Mean length6
Min length6

balance
Real number (ℝ≥0)

Distinct count8871
Unique (%)99.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1564.4748276781006
Minimum0.0
Maximum19043.13856
Zeros80
Zeros (%)0.9%
Memory size69.9 KiB

Quantile statistics

Minimum0
5-th percentile8.81451835
Q1128.2819155
median873.385231
Q32054.140036
95-th percentile5909.111808
Maximum19043.13856
Range19043.13856
Interquartile range (IQR)1925.85812

Descriptive statistics

Standard deviation2081.531879
Coefficient of variation (CV)1.330498799
Kurtosis7.6747513
Mean1564.474828
Median Absolute Deviation (MAD)799.865197
Skewness2.393386043
Sum14002049.71
Variance4332774.965
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0800.9%
 
1100.9410721< 0.1%
 
40.0744841< 0.1%
 
2093.8446561< 0.1%
 
179.7657081< 0.1%
 
12.6549031< 0.1%
 
1893.7048511< 0.1%
 
1571.2186951< 0.1%
 
31.2856081< 0.1%
 
1772.3234911< 0.1%
 
Other values (8861)886199.0%
 
ValueCountFrequency (%) 
0800.9%
 
0.0001991< 0.1%
 
0.0011461< 0.1%
 
0.0012141< 0.1%
 
0.0012891< 0.1%
 
ValueCountFrequency (%) 
19043.138561< 0.1%
 
18495.558551< 0.1%
 
16304.889251< 0.1%
 
16259.448571< 0.1%
 
16115.59641< 0.1%
 

balance_frequency
Real number (ℝ≥0)

Distinct count43
Unique (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.8772707255865921
Minimum0.0
Maximum1.0
Zeros80
Zeros (%)0.9%
Memory size69.9 KiB

Quantile statistics

Minimum0
5-th percentile0.272727
Q10.888889
median1
Q31
95-th percentile1
Maximum1
Range1
Interquartile range (IQR)0.111111

Descriptive statistics

Standard deviation0.2369040027
Coefficient of variation (CV)0.2700466296
Kurtosis3.092369622
Mean0.8772707256
Median Absolute Deviation (MAD)0
Skewness-2.023265519
Sum7851.572994
Variance0.05612350649
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1621169.4%
 
0.9090914104.6%
 
0.8181822783.1%
 
0.7272732232.5%
 
0.5454552192.4%
 
0.6363642092.3%
 
0.4545451721.9%
 
0.3636361701.9%
 
0.2727271511.7%
 
0.1818181461.6%
 
Other values (33)7618.5%
 
ValueCountFrequency (%) 
0800.9%
 
0.090909670.7%
 
0.180.1%
 
0.11111150.1%
 
0.12590.1%
 
ValueCountFrequency (%) 
1621169.4%
 
0.9090914104.6%
 
0.9550.6%
 
0.888889530.6%
 
0.875570.6%
 

purchases
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct count6203
Unique (%)69.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1003.2048335195531
Minimum0.0
Maximum49039.57
Zeros2044
Zeros (%)22.8%
Memory size69.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q139.635
median361.28
Q31110.13
95-th percentile3998.6195
Maximum49039.57
Range49039.57
Interquartile range (IQR)1070.495

Descriptive statistics

Standard deviation2136.634782
Coefficient of variation (CV)2.129809098
Kurtosis111.3887709
Mean1003.204834
Median Absolute Deviation (MAD)361.28
Skewness8.144269065
Sum8978683.26
Variance4565208.191
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0204422.8%
 
45.65270.3%
 
150160.2%
 
60160.2%
 
100130.1%
 
300130.1%
 
200130.1%
 
450120.1%
 
600100.1%
 
70100.1%
 
Other values (6193)677675.7%
 
ValueCountFrequency (%) 
0204422.8%
 
0.014< 0.1%
 
0.051< 0.1%
 
0.241< 0.1%
 
0.71< 0.1%
 
ValueCountFrequency (%) 
49039.571< 0.1%
 
41050.41< 0.1%
 
40040.711< 0.1%
 
38902.711< 0.1%
 
35131.161< 0.1%
 

oneoff_purchases
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct count4014
Unique (%)44.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean592.4373709497207
Minimum0.0
Maximum40761.25
Zeros4302
Zeros (%)48.1%
Memory size69.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median38
Q3577.405
95-th percentile2671.094
Maximum40761.25
Range40761.25
Interquartile range (IQR)577.405

Descriptive statistics

Standard deviation1659.887917
Coefficient of variation (CV)2.801794753
Kurtosis164.187572
Mean592.4373709
Median Absolute Deviation (MAD)38
Skewness10.04508288
Sum5302314.47
Variance2755227.898
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0430248.1%
 
45.65460.5%
 
50170.2%
 
200150.2%
 
60130.1%
 
100130.1%
 
70120.1%
 
150120.1%
 
1000120.1%
 
250110.1%
 
Other values (4004)449750.2%
 
ValueCountFrequency (%) 
0430248.1%
 
0.0170.1%
 
0.022< 0.1%
 
0.051< 0.1%
 
0.241< 0.1%
 
ValueCountFrequency (%) 
40761.251< 0.1%
 
40624.061< 0.1%
 
34087.731< 0.1%
 
33803.841< 0.1%
 
26547.431< 0.1%
 

installments_purchases
Real number (ℝ≥0)

ZEROS

Distinct count4452
Unique (%)49.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean411.0676446927374
Minimum0.0
Maximum22500.0
Zeros3916
Zeros (%)43.8%
Memory size69.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median89
Q3468.6375
95-th percentile1750.0875
Maximum22500
Range22500
Interquartile range (IQR)468.6375

Descriptive statistics

Standard deviation904.3381152
Coefficient of variation (CV)2.199973963
Kurtosis96.57517753
Mean411.0676447
Median Absolute Deviation (MAD)89
Skewness7.299119909
Sum3679055.42
Variance817827.4266
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0391643.8%
 
100140.2%
 
300140.2%
 
200140.2%
 
150120.1%
 
125110.1%
 
7590.1%
 
22580.1%
 
35080.1%
 
45080.1%
 
Other values (4442)493655.2%
 
ValueCountFrequency (%) 
0391643.8%
 
1.951< 0.1%
 
4.441< 0.1%
 
4.81< 0.1%
 
6.331< 0.1%
 
ValueCountFrequency (%) 
225001< 0.1%
 
15497.191< 0.1%
 
14686.11< 0.1%
 
13184.431< 0.1%
 
12738.471< 0.1%
 

cash_advance
Real number (ℝ≥0)

ZEROS

Distinct count4323
Unique (%)48.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean978.8711124654749
Minimum0.0
Maximum47137.211760000006
Zeros4628
Zeros (%)51.7%
Memory size69.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31113.821139
95-th percentile4647.169122
Maximum47137.21176
Range47137.21176
Interquartile range (IQR)1113.821139

Descriptive statistics

Standard deviation2097.163877
Coefficient of variation (CV)2.142431062
Kurtosis52.89943411
Mean978.8711125
Median Absolute Deviation (MAD)0
Skewness5.166609074
Sum8760896.457
Variance4398096.325
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0462851.7%
 
1286.3562071< 0.1%
 
3816.4702661< 0.1%
 
2495.2989261< 0.1%
 
748.2417271< 0.1%
 
1572.4927191< 0.1%
 
5425.9128071< 0.1%
 
9553.9559061< 0.1%
 
341.3608761< 0.1%
 
1424.4426021< 0.1%
 
Other values (4313)431348.2%
 
ValueCountFrequency (%) 
0462851.7%
 
14.2222161< 0.1%
 
18.0427681< 0.1%
 
18.1179671< 0.1%
 
18.1234131< 0.1%
 
ValueCountFrequency (%) 
47137.211761< 0.1%
 
29282.109151< 0.1%
 
27296.485761< 0.1%
 
26268.699891< 0.1%
 
26194.049541< 0.1%
 

purchases_frequency
Real number (ℝ≥0)

ZEROS

Distinct count47
Unique (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.49035054837988823
Minimum0.0
Maximum1.0
Zeros2043
Zeros (%)22.8%
Memory size69.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10.083333
median0.5
Q30.916667
95-th percentile1
Maximum1
Range1
Interquartile range (IQR)0.833334

Descriptive statistics

Standard deviation0.4013707474
Coefficient of variation (CV)0.8185383879
Kurtosis-1.638630948
Mean0.4903505484
Median Absolute Deviation (MAD)0.416667
Skewness0.06016423586
Sum4388.637408
Variance0.1610984768
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1217824.3%
 
0204322.8%
 
0.0833336777.6%
 
0.9166673964.4%
 
0.53954.4%
 
0.1666673924.4%
 
0.8333333734.2%
 
0.3333333674.1%
 
0.253453.9%
 
0.5833333163.5%
 
Other values (37)146816.4%
 
ValueCountFrequency (%) 
0204322.8%
 
0.0833336777.6%
 
0.090909430.5%
 
0.1270.3%
 
0.111111180.2%
 
ValueCountFrequency (%) 
1217824.3%
 
0.9166673964.4%
 
0.909091280.3%
 
0.9240.3%
 
0.888889180.2%
 

oneoff_purchases_frequency
Real number (ℝ≥0)

ZEROS

Distinct count47
Unique (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.202457683575419
Minimum0.0
Maximum1.0
Zeros4302
Zeros (%)48.1%
Memory size69.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0.083333
Q30.3
95-th percentile1
Maximum1
Range1
Interquartile range (IQR)0.3

Descriptive statistics

Standard deviation0.2983360652
Coefficient of variation (CV)1.473572452
Kurtosis1.161845601
Mean0.2024576836
Median Absolute Deviation (MAD)0.083333
Skewness1.535612784
Sum1811.996268
Variance0.08900440779
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0430248.1%
 
0.083333110412.3%
 
0.1666675926.6%
 
14815.4%
 
0.254184.7%
 
0.3333333554.0%
 
0.4166672442.7%
 
0.52352.6%
 
0.5833331972.2%
 
0.6666671671.9%
 
Other values (37)8559.6%
 
ValueCountFrequency (%) 
0430248.1%
 
0.083333110412.3%
 
0.090909560.6%
 
0.1390.4%
 
0.111111260.3%
 
ValueCountFrequency (%) 
14815.4%
 
0.9166671511.7%
 
0.9090914< 0.1%
 
0.91< 0.1%
 
0.8888892< 0.1%
 

purchases_installments_frequency
Real number (ℝ≥0)

ZEROS

Distinct count47
Unique (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.3644373415642458
Minimum0.0
Maximum1.0
Zeros3915
Zeros (%)43.7%
Memory size69.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0.166667
Q30.75
95-th percentile1
Maximum1
Range1
Interquartile range (IQR)0.75

Descriptive statistics

Standard deviation0.3974477797
Coefficient of variation (CV)1.090579187
Kurtosis-1.398632185
Mean0.3644373416
Median Absolute Deviation (MAD)0.166667
Skewness0.509201165
Sum3261.714207
Variance0.1579647376
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0391543.7%
 
1133114.9%
 
0.4166673884.3%
 
0.9166673453.9%
 
0.8333333113.5%
 
0.53103.5%
 
0.1666673053.4%
 
0.6666672923.3%
 
0.752913.3%
 
0.0833332753.1%
 
Other values (37)118713.3%
 
ValueCountFrequency (%) 
0391543.7%
 
0.0833332753.1%
 
0.090909120.1%
 
0.160.1%
 
0.11111190.1%
 
ValueCountFrequency (%) 
1133114.9%
 
0.9166673453.9%
 
0.909091250.3%
 
0.9190.2%
 
0.888889280.3%
 

cash_advance_frequency
Real number (ℝ≥0)

ZEROS

Distinct count54
Unique (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.13514420033519556
Minimum0.0
Maximum1.5
Zeros4628
Zeros (%)51.7%
Memory size69.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30.222222
95-th percentile0.583333
Maximum1.5
Range1.5
Interquartile range (IQR)0.222222

Descriptive statistics

Standard deviation0.2001213881
Coefficient of variation (CV)1.480798937
Kurtosis3.334734328
Mean0.1351442003
Median Absolute Deviation (MAD)0
Skewness1.828686266
Sum1209.540593
Variance0.04004856999
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0462851.7%
 
0.083333102111.4%
 
0.1666677598.5%
 
0.255786.5%
 
0.3333334394.9%
 
0.4166672733.1%
 
0.52152.4%
 
0.5833331421.6%
 
0.6666671251.4%
 
0.090909700.8%
 
Other values (44)7007.8%
 
ValueCountFrequency (%) 
0462851.7%
 
0.083333102111.4%
 
0.090909700.8%
 
0.1390.4%
 
0.111111290.3%
 
ValueCountFrequency (%) 
1.51< 0.1%
 
1.251< 0.1%
 
1.1666672< 0.1%
 
1.1428571< 0.1%
 
1.1251< 0.1%
 

cash_advance_trx
Real number (ℝ≥0)

ZEROS

Distinct count65
Unique (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.2488268156424582
Minimum0
Maximum123
Zeros4628
Zeros (%)51.7%
Memory size69.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q34
95-th percentile15
Maximum123
Range123
Interquartile range (IQR)4

Descriptive statistics

Standard deviation6.824646744
Coefficient of variation (CV)2.100649598
Kurtosis61.64686248
Mean3.248826816
Median Absolute Deviation (MAD)0
Skewness5.721298203
Sum29077
Variance46.57580318
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0462851.7%
 
18879.9%
 
26206.9%
 
34364.9%
 
43844.3%
 
53083.4%
 
62462.7%
 
72052.3%
 
81711.9%
 
101501.7%
 
Other values (55)91510.2%
 
ValueCountFrequency (%) 
0462851.7%
 
18879.9%
 
26206.9%
 
34364.9%
 
43844.3%
 
ValueCountFrequency (%) 
1233< 0.1%
 
1101< 0.1%
 
1071< 0.1%
 
931< 0.1%
 
801< 0.1%
 

purchases_trx
Real number (ℝ≥0)

ZEROS

Distinct count173
Unique (%)1.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14.709832402234637
Minimum0
Maximum358
Zeros2044
Zeros (%)22.8%
Memory size69.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median7
Q317
95-th percentile57
Maximum358
Range358
Interquartile range (IQR)16

Descriptive statistics

Standard deviation24.85764911
Coefficient of variation (CV)1.689866236
Kurtosis34.79310026
Mean14.7098324
Median Absolute Deviation (MAD)7
Skewness4.630655266
Sum131653
Variance617.9027193
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0204422.8%
 
16677.5%
 
125706.4%
 
23794.2%
 
63523.9%
 
33143.5%
 
42853.2%
 
72753.1%
 
82673.0%
 
52673.0%
 
Other values (163)353039.4%
 
ValueCountFrequency (%) 
0204422.8%
 
16677.5%
 
23794.2%
 
33143.5%
 
42853.2%
 
ValueCountFrequency (%) 
3581< 0.1%
 
3471< 0.1%
 
3441< 0.1%
 
3091< 0.1%
 
3081< 0.1%
 

credit_limit
Real number (ℝ≥0)

Distinct count205
Unique (%)2.3%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean4494.449450364621
Minimum50.0
Maximum30000.0
Zeros0
Zeros (%)0.0%
Memory size69.9 KiB

Quantile statistics

Minimum50
5-th percentile1000
Q11600
median3000
Q36500
95-th percentile12000
Maximum30000
Range29950
Interquartile range (IQR)4900

Descriptive statistics

Standard deviation3638.815725
Coefficient of variation (CV)0.8096243524
Kurtosis2.836655932
Mean4494.44945
Median Absolute Deviation (MAD)1800
Skewness1.522464005
Sum40220828.13
Variance13240979.88
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
30007848.8%
 
15007228.1%
 
12006216.9%
 
10006146.9%
 
25006126.8%
 
40005065.7%
 
60004635.2%
 
50003894.3%
 
20003714.1%
 
75002773.1%
 
Other values (195)359040.1%
 
ValueCountFrequency (%) 
501< 0.1%
 
15050.1%
 
2003< 0.1%
 
300140.2%
 
4003< 0.1%
 
ValueCountFrequency (%) 
300002< 0.1%
 
280001< 0.1%
 
250001< 0.1%
 
230002< 0.1%
 
225001< 0.1%
 

payments
Real number (ℝ≥0)

ZEROS

Distinct count8711
Unique (%)97.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1733.1438520248043
Minimum0.0
Maximum50721.483360000006
Zeros240
Zeros (%)2.7%
Memory size69.9 KiB

Quantile statistics

Minimum0
5-th percentile89.98892395
Q1383.276166
median856.901546
Q31901.134317
95-th percentile6082.090595
Maximum50721.48336
Range50721.48336
Interquartile range (IQR)1517.858151

Descriptive statistics

Standard deviation2895.063757
Coefficient of variation (CV)1.670411693
Kurtosis54.77073581
Mean1733.143852
Median Absolute Deviation (MAD)581.3514605
Skewness5.907619794
Sum15511637.48
Variance8381394.157
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
02402.7%
 
806.5874821< 0.1%
 
836.8124141< 0.1%
 
139.6078271< 0.1%
 
107.2424081< 0.1%
 
433.8607141< 0.1%
 
402.200141< 0.1%
 
308.8864921< 0.1%
 
238.6951151< 0.1%
 
197.863491< 0.1%
 
Other values (8701)870197.2%
 
ValueCountFrequency (%) 
02402.7%
 
0.0495131< 0.1%
 
0.0564661< 0.1%
 
2.3895831< 0.1%
 
3.5005051< 0.1%
 
ValueCountFrequency (%) 
50721.483361< 0.1%
 
46930.598241< 0.1%
 
40627.595241< 0.1%
 
39461.96581< 0.1%
 
39048.597621< 0.1%
 

minimum_payments
Real number (ℝ≥0)

MISSING

Distinct count8636
Unique (%)> 99.9%
Missing313
Missing (%)3.5%
Infinite0
Infinite (%)0.0%
Mean864.2065423050827
Minimum0.019163
Maximum76406.20752000001
Zeros0
Zeros (%)0.0%
Memory size69.9 KiB

Quantile statistics

Minimum0.019163
5-th percentile73.2820058
Q1169.123707
median312.343947
Q3825.485459
95-th percentile2766.56331
Maximum76406.20752
Range76406.18836
Interquartile range (IQR)656.361752

Descriptive statistics

Standard deviation2372.446607
Coefficient of variation (CV)2.745231019
Kurtosis283.9899859
Mean864.2065423
Median Absolute Deviation (MAD)190.374096
Skewness13.62279699
Sum7464151.906
Variance5628502.901
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
299.3518812< 0.1%
 
284.706931< 0.1%
 
144.6418141< 0.1%
 
1927.8875471< 0.1%
 
6825.442031< 0.1%
 
342.4124761< 0.1%
 
5583.6304821< 0.1%
 
125.34941< 0.1%
 
3.197941< 0.1%
 
140.5961381< 0.1%
 
Other values (8626)862696.4%
 
(Missing)3133.5%
 
ValueCountFrequency (%) 
0.0191631< 0.1%
 
0.0377441< 0.1%
 
0.055881< 0.1%
 
0.0594811< 0.1%
 
0.1170361< 0.1%
 
ValueCountFrequency (%) 
76406.207521< 0.1%
 
61031.61861< 0.1%
 
56370.041171< 0.1%
 
50260.759471< 0.1%
 
43132.728231< 0.1%
 

prc_full_payment
Real number (ℝ≥0)

ZEROS

Distinct count47
Unique (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.15371464849162012
Minimum0.0
Maximum1.0
Zeros5903
Zeros (%)66.0%
Memory size69.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30.142857
95-th percentile1
Maximum1
Range1
Interquartile range (IQR)0.142857

Descriptive statistics

Standard deviation0.2924991962
Coefficient of variation (CV)1.902871321
Kurtosis2.432395301
Mean0.1537146485
Median Absolute Deviation (MAD)0
Skewness1.942819941
Sum1375.746104
Variance0.0855557798
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0590366.0%
 
14885.5%
 
0.0833334264.8%
 
0.1666671661.9%
 
0.251561.7%
 
0.51561.7%
 
0.0909091531.7%
 
0.3333331341.5%
 
0.1941.1%
 
0.2830.9%
 
Other values (37)119113.3%
 
ValueCountFrequency (%) 
0590366.0%
 
0.0833334264.8%
 
0.0909091531.7%
 
0.1941.1%
 
0.111111610.7%
 
ValueCountFrequency (%) 
14885.5%
 
0.916667770.9%
 
0.909091190.2%
 
0.9160.2%
 
0.888889120.1%
 

tenure
Real number (ℝ≥0)

Distinct count7
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.51731843575419
Minimum6
Maximum12
Zeros0
Zeros (%)0.0%
Memory size69.9 KiB

Quantile statistics

Minimum6
5-th percentile8
Q112
median12
Q312
95-th percentile12
Maximum12
Range6
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.338330769
Coefficient of variation (CV)0.1162015947
Kurtosis7.694823186
Mean11.51731844
Median Absolute Deviation (MAD)0
Skewness-2.943017288
Sum103080
Variance1.791129248
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
12758484.7%
 
113654.1%
 
102362.6%
 
62042.3%
 
81962.2%
 
71902.1%
 
91752.0%
 
ValueCountFrequency (%) 
62042.3%
 
71902.1%
 
81962.2%
 
91752.0%
 
102362.6%
 
ValueCountFrequency (%) 
12758484.7%
 
113654.1%
 
102362.6%
 
91752.0%
 
81962.2%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

Sample

First rows

cust_idbalancebalance_frequencypurchasesoneoff_purchasesinstallments_purchasescash_advancepurchases_frequencyoneoff_purchases_frequencypurchases_installments_frequencycash_advance_frequencycash_advance_trxpurchases_trxcredit_limitpaymentsminimum_paymentsprc_full_paymenttenure
0C1000140.9007490.81818295.400.0095.400.0000000.1666670.0000000.0833330.000000021000.0201.802084139.5097870.00000012
1C100023202.4674160.9090910.000.000.006442.9454830.0000000.0000000.0000000.250000407000.04103.0325971072.3402170.22222212
2C100032495.1488621.000000773.17773.170.000.0000001.0000001.0000000.0000000.0000000127500.0622.066742627.2847870.00000012
3C100041666.6705420.6363641499.001499.000.00205.7880170.0833330.0833330.0000000.083333117500.00.000000NaN0.00000012
4C10005817.7143351.00000016.0016.000.000.0000000.0833330.0833330.0000000.000000011200.0678.334763244.7912370.00000012
5C100061809.8287511.0000001333.280.001333.280.0000000.6666670.0000000.5833330.000000081800.01400.0577702407.2460350.00000012
6C10007627.2608061.0000007091.016402.63688.380.0000001.0000001.0000001.0000000.00000006413500.06354.314328198.0658941.00000012
7C100081823.6527431.000000436.200.00436.200.0000001.0000000.0000001.0000000.0000000122300.0679.065082532.0339900.00000012
8C100091014.9264731.000000861.49661.49200.000.0000000.3333330.0833330.2500000.000000057000.0688.278568311.9634090.00000012
9C10010152.2259750.5454551281.601281.600.000.0000000.1666670.1666670.0000000.0000000311000.01164.770591100.3022620.00000012

Last rows

cust_idbalancebalance_frequencypurchasesoneoff_purchasesinstallments_purchasescash_advancepurchases_frequencyoneoff_purchases_frequencypurchases_installments_frequencycash_advance_frequencycash_advance_trxpurchases_trxcredit_limitpaymentsminimum_paymentsprc_full_paymenttenure
8940C19181130.8385541.000000591.240.00591.240.0000001.0000000.0000000.8333330.000000061000.0475.52326282.7713201.006
8941C191825967.4752700.833333214.550.00214.558555.4093260.8333330.0000000.6666670.6666671359000.0966.202912861.9499060.006
8942C1918340.8297491.000000113.280.00113.280.0000001.0000000.0000000.8333330.000000061000.094.48882886.2831010.256
8943C191845.8717120.50000020.9020.900.000.0000000.1666670.1666670.0000000.00000001500.058.64488343.4737170.006
8944C19185193.5717220.8333331012.731012.730.000.0000000.3333330.3333330.0000000.000000024000.00.000000NaN0.006
8945C1918628.4935171.000000291.120.00291.120.0000001.0000000.0000000.8333330.000000061000.0325.59446248.8863650.506
8946C1918719.1832151.000000300.000.00300.000.0000001.0000000.0000000.8333330.000000061000.0275.861322NaN0.006
8947C1918823.3986730.833333144.400.00144.400.0000000.8333330.0000000.6666670.000000051000.081.27077582.4183690.256
8948C1918913.4575640.8333330.000.000.0036.5587780.0000000.0000000.0000000.16666720500.052.54995955.7556280.256
8949C19190372.7080750.6666671093.251093.250.00127.0400080.6666670.6666670.0000000.3333332231200.063.16540488.2889560.006